Phylogenetic signal and noise: predicting the power of a data set to resolve phylogeny.

نویسندگان

  • Jeffrey P Townsend
  • Zhuo Su
  • Yonas I Tekle
چکیده

A principal objective for phylogenetic experimental design is to predict the power of a data set to resolve nodes in a phylogenetic tree. However, proactively assessing the potential for phylogenetic noise compared with signal in a candidate data set has been a formidable challenge. Understanding the impact of collection of additional sequence data to resolve recalcitrant internodes at diverse historical times will facilitate increasingly accurate and cost-effective phylogenetic research. Here, we derive theory based on the fundamental unit of the phylogenetic tree, the quartet, that applies estimates of the state space and the rates of evolution of characters in a data set to predict phylogenetic signal and phylogenetic noise and therefore to predict the power to resolve internodes. We develop and implement a Monte Carlo approach to estimating power to resolve as well as deriving a nearly equivalent faster deterministic calculation. These approaches are applied to describe the distribution of potential signal, polytomy, or noise for two example data sets, one recent (cytochrome c oxidase I and 28S ribosomal rRNA sequences from Diplazontinae parasitoid wasps) and one deep (eight nuclear genes and a phylogenomic sequence for diverse microbial eukaryotes including Stramenopiles, Alveolata, and Rhizaria). The predicted power of resolution for the loci analyzed is consistent with the historic use of the genes in phylogenetics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phylogeny of some species of Astragalus (Fabaceae) based on morphological data

The phylogenetic relationships among 39 species belonging to 12 sections of Astragalus from Iran were studied on the basis of 29 morphological characters. The cladistics analysis of the morphological data was performed using PAUP* 4.0b10 program. The obtained data were compared with the molecular systematics data obtained from nuclear DNA ITS. In contrast with previous molecular systematic stud...

متن کامل

Grid Impedance Estimation Using Several Short-Term Low Power Signal Injections

In this paper, a signal processing method is proposed to estimate the low and high-frequency impedances of power systems using several short-term low power signal injections for a frequency range of 0-150 kHz. This frequency range is very important, and thusso it is considered in the analysis of power quality issues of smart grids. The impedance estimation is used in many power system applicati...

متن کامل

The impact of incorporating molecular evolutionary model into predictions of phylogenetic signal and noise

*Correspondence: Jeffrey P. Townsend, Department of Biostatistics, Yale School of Public Health, 135 College St. #222, New Haven, CT 06510, USA e-mail: [email protected] Phylogenetic inference can be improved by the development and use of better models for inference given the data available, or by gathering more appropriate data given the potential inferences to be made. Numerous studie...

متن کامل

Application of Single-Frequency Time-Space Filtering Technique for Seismic Ground Roll and Random Noise Attenuation

Time-frequency filtering is an acceptable technique for attenuating noise in 2-D (time-space) and 3-D (time-space-space) reflection seismic data. The common approach for this purpose is transforming each seismic signal from 1-D time domain to a 2-D time-frequency domain and then denoising the signal by a designed filter and finally transforming back the filtered signal to original time domain. ...

متن کامل

Enhancement of Noise Performance in Digital Receivers by Over Sampling the Received Signal

In wireless channel the noise has a zero mean. This channel property can be used in the enhancement of the noise performance in the digital receivers by oversampling the received signal and calculating the decision variable based on the time average of more than one sample of the received signal. The averaging process will reduce the effect of the noise in the decision variable that will approa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Systematic biology

دوره 61 5  شماره 

صفحات  -

تاریخ انتشار 2012